# GGUF Format

Neobert GGUF
MIT
This is a static quantized version of the chandar-lab/NeoBERT model, aiming to reduce model storage space and computational resource requirements.
Large Language Model Transformers English
N
mradermacher
219
1
Qari OCR 0.3 SNAPSHOT VL 2B Instruct Merged GGUF
This is a statically quantized version based on the Qari-OCR-0.3-SNAPSHOT-VL-2B-Instruct-merged model, mainly used for image-to-text conversion tasks.
Image-to-Text Transformers English
Q
mradermacher
188
0
Qwen2 Audio 7B Instruct GGUF
Apache-2.0
Static quantized version of Qwen2-Audio-7B-Instruct model, supporting English audio-to-text conversion tasks
Audio-to-Text Transformers English
Q
mradermacher
146
0
Vintern 1B V3 5 GGUF Ext
MIT
Vintern-1B-v3_5 is a 1-billion-parameter vision-language model supporting image-text generation tasks.
Text-to-Image
V
rootonchair
242
1
Wan2.1 T2V 14B CausVid GGUF
Apache-2.0
This is a GGUF format conversion version based on the Wan-AI/Wan2.1-T2V-14B model, primarily used for text-to-video generation tasks.
Text-to-Video English
W
Njbx
190
0
INTELLECT 2 GGUF
INTELLECT-2-GGUF is the GGUF format quantized version of PrimeIntellect/INTELLECT-2, suitable for text generation tasks.
Large Language Model
I
MaziyarPanahi
88
1
Clinician Note 2.0a I1 GGUF
Clinician-Note-2.0a is a text generation model specialized in the medical field, specifically designed for clinical note and summarization tasks.
Large Language Model English
C
mradermacher
352
0
Skyreels V2 I2V 14B 540P GGUF
Other
SkyReels-V2-I2V-14B-540P is a GGUF-format converted image-to-video model that supports generating dynamic video content from static images.
Video Processing
S
wsbagnsv1
929
8
Orpheus TTS Turkish PT 5000 Q5 K M GGUF
MIT
This is a Turkish text-to-speech (TTS) model based on the Orpheus architecture, converted to GGUF format for use with llama.cpp.
Speech Synthesis Other
O
Karayakar
45
0
Kokoro GGUF
MIT
Kokoro is a text-to-speech (TTS) model offering GGUF-encoded versions with dual phonemization support.
Speech Synthesis
K
mmwillet2
749
1
News Summarizer T5 GGUF
Apache-2.0
This is a statically quantized version of a T5-based news summarization model, supporting English text summarization tasks.
Text Generation English
N
mradermacher
167
0
Gemma 3 12b It GGUF
GGUF quantized version of Gemma 3 12B, suitable for text generation tasks.
Large Language Model
G
MaziyarPanahi
641.41k
4
Gemma 3 4b It GGUF
GGUF quantized version of the Gemma 3B model, suitable for local text generation tasks
Large Language Model
G
MaziyarPanahi
358.91k
6
Greek Text Summarization GGUF
Apache-2.0
Static quantized version based on kriton/greek-text-summarization, specialized for Greek text summarization tasks
Text Generation Other
G
mradermacher
216
0
VISION 1 GGUF
VISION-1 is a content moderation and safety classification model based on transformers, focusing on text classification tasks.
Text Classification English
V
mradermacher
153
2
Parler Tts Mini V1 GGUF
Apache-2.0
GGUF format model file of Parler TTS Mini v1 for text-to-speech tasks, supporting the English language.
Speech Synthesis English
P
ecyht2
198
4
Bge Reranker Base Q8 0 GGUF
MIT
This model is converted from BAAI/bge-reranker-base into the GGUF format, primarily used for text reordering tasks.
Text Embedding Supports Multiple Languages
B
xinming0111
106
1
Glm Edge V 5b Gguf
Other
Glm-Edge-V-5B-GGUF is a multilingual image-text generation model supporting Chinese and English, developed based on the GLM architecture.
Large Language Model Supports Multiple Languages
G
THUDM
486
7
T5 Large Q4 K M GGUF
Apache-2.0
This model is a GGUF-converted version of google-t5/t5-large, supporting tasks like summarization and translation, and is applicable to multiple languages including English, French, Romanian, German, and more.
Large Language Model Supports Multiple Languages
T
tianlp
16
0
Chromafur Alpha Gguf
Other
ChromaFur Alpha is a text-to-image generation model converted to GGUF format, suitable for low-end GPUs or users who prefer fast loading.
Image Generation
C
WWizrd
13
1
Yi Coder 1.5B Chat GGUF
Yi-Coder-1.5B-Chat-GGUF is the GGUF format model file of 01-ai/Yi-Coder-1.5B-Chat, suitable for text generation tasks.
Large Language Model
Y
MaziyarPanahi
254.78k
10
FLUX.1 Dev Q8 Fp16 Fp32 Mix 8 To 32 Bpw Gguf
Other
Experimental GGUF-converted version of Flux.1-dev, featuring various mixed-precision quantization schemes
Text-to-Image
F
mo137
257
12
Phi 3.5 Mini Instruct GGUF
GGUF format model file for Phi-3.5-mini-instruct, suitable for text generation tasks.
Large Language Model
P
MaziyarPanahi
335.88k
13
Madlad400 3b Mt Q8 0 GGUF
Apache-2.0
MADLAD-400 3B multilingual translation model, supporting translation tasks for over 400 languages, optimized in GGUF format.
Machine Translation Supports Multiple Languages
M
mtsdurica
47
1
Yes
Apache-2.0
This is a Llama3 model fine-tuned on the PubMedQA dataset, specializing in medical Q&A tasks.
Large Language Model English
Y
ThorBaller
22
1
Gemma 2b It GGUF
Other
GGUF quantized version of the Gemma 2B model, suitable for local deployment and inference
Large Language Model
G
MaziyarPanahi
517
10
Sqlcoder 7b 2 GGUF
This is the GGUF format quantized version of the defog/sqlcoder-7b-2 model, primarily used for SQL code generation tasks.
Large Language Model
S
MaziyarPanahi
346
10
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase